Next Generation Optical Character Recognition using the Polynomial Method

نویسنده

  • Sam Danziger
چکیده

This project extends on the work done Drs. Anderson and Gaborski’ in Optical Character Recognition (OCR) of handwritten digits done back in the 1990s. Their work involved using the polynomial discriminant method, iterative training, and genetic algorithms to optimize the feature selection. This project involves several variations in technique over previous work. The first is the use of the matrix manipulation language Matlab rather than ’C’ This allowed the faster training of the matrices (due to highly optimized matrix manipulation functions) and rapid debugging due to a built in debugger and other high level language advantages. This project also utilized a less random form of feature selection than the ”knight moves” or other feature selection techniques used previously. This technique involves compressing the digits from 30X20 pixels into 8X5 and 6X4 ”fat pixels” and then taking all possible combinations of pixels between the two compressed digits (resulting in 960 features) This project also explored the creation of a network of classifiers trained to differentiate between between all possible pairs of digits (0,1), (7,8), et. al. This resulted in 10C2 = 45 classifiers used in tandem to recognize the hand written digits. The Polynomial Classifiers initially created were very similar to those produced previously with one major exception: When the 960 features were reduced to 300 features using the genetic algorithm, the weight matrix proved to be surprisingly skilled at identifying previously unseen digits. For example, when classifying digits not used to train the matrix, the 960 features produced a 43% accuracy while the 300 features produced a 75% accuracy. By using a larger training set, and training for more iterations, this accuracy was improved to nearly 80% correctly classified digits. The 10C2 = 45 network classifier performed best of all classifying up to 83% of unseen digits correctly, opening up a whole new area for further research.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optical Character Recognition on Grid and Multi-core Systems – Performance Analysis

Original scientific paper Requirements of digital libraries for computing power, as well as needs of smaller users for automated document processing and digitization are growing, making grid environments and multi-core systems a preferred platform for optical character recognition applications. This paper examines character recognition performance on the CRO-NGI grid and on two multi-core syste...

متن کامل

Character Recognition U sing Canonical

This paper presents a new insight into character recognition problem. Implicit polynomial (IP) curves have been used for modelling characters. A unique decomposition theorem is employed to decompose these curves into simple line primitives. For the comparison of the characters, canonical invariants have been computed using so called "related points" of the curves, which are the real intersectio...

متن کامل

Character Recognition Using Canonical Invariants

This paper presents a new insight into character recognition problem. Implicit polynomial (IP) curves have been used for modelling characters. A unique decomposition theorem is employed to decompose these curves into simple line primitives. For the comparison of the characters, canonical invariants have been computed using so called “related points” of the curves, which are the real intersectio...

متن کامل

Next Generation Tele-Teaching: Latest Recording Technology, User Engagement and Automatic Metadata Retrieval

With the latest technological development in the last decade, new opportunities for learning environments and educational systems, such as tele-teaching, arose. Nowadays recording technology includes easy and fast workflows, high definition video recording, multiple sources and diverse output formats. With the amount of tele-teaching content growing, issues with sufficient metadata start existi...

متن کامل

Two Dimensional Principal Component Analysis for Online Tamil Character Recognition

This paper presents a new application of two dimensional Principal Component Analysis (2DPCA) to the problem of online character recognition in Tamil Script. A novel set of features employing polynomial fits and quartiles in combination with conventional features are derived for each sample point of the Tamil character obtained after smoothing and resampling. These are stacked to form a matrix,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002